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Methods for Obtaining Plant Varietfes ll : yj 



TECHNICAL FIELD 

The present invention relates to nucleotide sequences which encode polypeptides 
involved in the DNA mismatch repair systems of plants, and to the polypeptides encoded 
5~i?y"those^nucleotide-*sequenees_ — ^^e-invention-also-relates-to^nucleotide-sequences^and- 
polypeptide sequences for use in altering the DNA mismatch repair system in plants. The 
invention also relates to a process for altering the DNA mismatch repair system of a plant 
cell, to a process for increasing genetic variations in plants and to processes for obtaining 
plants having a desired characteristic. 

o BACKGROUND OF THE INVENTION 

Plant breeding essentially relies on and makes use of genetic variation which occurs 
naturally within and between members of a family, a genus, a species or a subspecies. 
Another source of genetic variation is the introduction of genes from other organisms 
which may or may not be related to the host plant. 

1 5 Allelic loci or non-allelic genes which constitute or contribute to desired quantitative 
(e.g. growth performance, yield, etc.) or qualitative (e.g. deposition, content and 
composition of seed storage products; pathogen resistance genes; etc.) traits that are 
absent, incomplete or inefficient in a species or subspecies of interest are typically 
introduced by the plant breeder from other species or subspecies, or de novo. This 

20 introduction is often done by crossing, provided that the species to be crossed are sexually 
compatible. Other means of introducing genomes, individual chromosomes or genes into 
plant cells or plants are well known in the art. They include cell fusion, chemically aided 
transfection (Schocher et al., 1986, Biotechnology 4: 1093) and ballistic (McCabe et al., 
1988, Biotechnology 6: 923), microinjection (Neuhaus et al., 1987, TAG 75: 30), 

25 electroporation of protoplasts (Chupeau et al., 1989, Biotechnology 7: 53) or microbial 
transformation methods such as Agrobacterium mediated transformation (Horsch et al., 
1985, Science 227: 1229; Hiei et al., 1996, Biotechnology 14: 745). 

However, when a foreign genome, chromosome or gene is introduced into a plant, 
it will often segregate in subsequent generations from the genome of the recipient plant or 
so pl ant cell dining mitotic and mciotic cell divisions and, in consequ e nce, become lost from 

thp-hnyf-pHnt -nr-plflnf-PelUnto-whirhJ^ 

introduced genome, chromosome or gene physically combines entirely or in part with the 
genome, chromosome or gene of the host plant or plant cell in a process which is called 
recombination. 

35 Recombination involves the exchange of covalent linkages between DNA molecules 
in regions of identical or similar sequence. It is referred to here as homologous 
recombination if donor and recipient DNA are identical or nearly identical (at least 98% 
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base sequence identity), and as homeologous recombination if donor aMi recipient J*>1H[£* 
are not identical but are similar (less than 98% base sequence identity). 

The ability of two genomes, chromosomes or genes to recombine is known to 
depend largely on the evolutionary relation between them and thus on the degree of 
sequence si milarity between the two DNA molecules. Whereas homologous 
recombination is frequently observed during mitosis and meiosis, homeologous 
recombination is rarely or never seen. 

From a breeder's perspective, the limits within which homologous recombination 
occurs, therefore, define a genetic barrier between species, varieties or lines, in contrast 

10 to homeologous recombination which can break this barrier. Homeologous recombination 
is thus of great importance for plant breeding. Accordingly there is a need for a process 
for enhancing the frequency of homeologous recombination in plants. In particular, there 
is a need for a process of increasing homeologous recombination to significantly shorten 
the length of breeding programs by reducing the number of crosses required to obtain an 

1 5 otherwise rare recombination event. 

At least in Escherichia coli, homologous and homeologous recombination are 
known to share a common pathway that requires among others the proteins RecA, RecB, 
RecC, RecD and makes use of the SOS induced RuvA and RuvB, respectively. It has 
been suggested that mating induced recombination follows the Double-Strand Break 

20 Repair model (Szostak et al., 1983, Cell 33, 25-35), which is widely used to describe 
genetic recombination in eukaryotes. Following the alignment of homologous or 
homeologous DNA double helices the RecA protein mediates an exchange of a single 
DNA strand from the donor helix to the aligned recipient DNA helix. The incoming 
strand screens the recipient helix for sequence complementarity, seeking to form a 

25 heteroduplex by hydrogen bonding the complementary strand. The displaced homologous 
or homeologous strand of the recipient helix is guided into the donor helix where it base 
pairs with its counterpart strand to form a second heteroduplex. The resulting branch 
point then migrates along the aligned chromosomes thereby elongating and thus stabilising 
the initial heteroduplexes. Single stranded gaps (if present) are closed by DNA synthesis. 

30 The strand cross overs (Holliday junction) are eventually resolved enzymatically to yield 
the recombination products. 

-Although -in wi1d-ty pe-£.-Cfl//-homologpu^ thus 

mechanistically similar if not identical, homologous recombination in conjugational 
crosses E. coli x E. coli occurs five orders of magnitude more frequently than 

35 homeologous recombination in conjugational crosses E. coli x S. typhimurium (Matic et 
al. 1995; Cell 80, 507-515). The imbalance in favour of homologous recombination was 
shown to be caused largely by the bacterial MisMatch Repair (MMR) system since its 
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inactivation increased the frequency of homeologous recombination in £?. c&H*up to. lOppC 
fold (Rayssiguier et al. 1989, Nature 342, 396-401). 

In E. coli, the MMR system (reviewed by Modrich 1991, Annual Rev Genetics 25, 
229-253) is composed of only three proteins known as MutS, MutL and MutH. MutS 
5 reco gnizes and binds to base pair mi smatches. MutL the n forms a stab le complex with 
mismatch bound MutS. This protein complex now activates the MutH intrinsic single 
stranded endonuclease which nicks the strand containing the misplaced base and thereby 
prepares the template for DNA repair enzymes. 

During recombination, MMR components inhibit homeologous recombination. In 
io vitro experiments demonstrated that MutS in complex with MutL binds to mismatches at 
the recombination branch point and physically blocks RecA mediated strand exchange and 
heteroduplex formation (Worth et aL, 1994; PNAS 91, 3238-3241). Interestingly, the 
SOS dependent RuvAB mediated branch migration is insensitive to MutS/MutL, 
explaining the observed slight increase in SOS dependent homeologous recombination. 
15 Homeologous mating even induces the SOS response, thereby taking advantage of RuvAB 
induction (Matic et al. 1995, Cell 80, 507-515). 

The MMR system thus appears to be a genetic guardian over genome stability in E. 
coli. In this role it essentially determines the extent to which genetic isolation, that is, 
speciation, occurs. The diminished sensitivity of the SOS system to MMR, however, 
20 allows (within limits) for rapid genomic changes at times of stress, providing the means 
for fast adaptation to altered environmental conditions and thus contributing to 
intraspecies genetic variation and species evolution. 

The important role of MMR in preserving genomic integrity has been established 
also in certain eukaryotes. In its efficiency, the human MMR, for example, may even 
25 counteract potential gene therapy tools such as triple-helix forming oligonucleotides 
including RNA-DNA hybrid molecules (Havre et al., 1993, J. Virology 67: 7234-7331; 
Wang et al., 1995, Moi. Cell. Biol. 15: 1759-1768; Kotani et al., 1996, Mol. Gen. 
Genetics 250: 626-634; Cole-Strauss et al., 1996, Science 273: 1387-1389). Such 
oligonucleotides are designed to introduce single base changes into selected DNA target 
30 sequences in order to inactivate for example cancer genes or to restore their normal 
function Th o r e sulting b as e mi s mat r h p' ; hnwpypr are r ecognised by the mi smatch repair 
system—which— then-directs^removal„c^me_mismatc^ reducing the 

efficiency of oligonucleotide induced site-specific mutagenesis. 

To date, two families of related genes, homologous to the bacterial MutS and MutL 
35 genes have been identified or isolated in yeast and mammals (recent reviews by Arnheim 
and Shibata, 1997, Curr. Opinion Genet. Dev. 7, 364-370; Modrich and Lahue, 1996, 
Annual Rev. Biochem. 65, 101-133; Umar and Kunkel, 1996, Eur. J. Biochem. 238, 
297-307). Biochemical and genetic analysis indicated that eukaryotic MutS homologs 
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(MSH) and MutL homologs (MLH, PMS), respectively, fulfil similar pffltein»ftinctions }& 
their bacterial counterparts. Their relative abundance, however, could reflect different 
mismatch specificity and/or specialisation for different tissues or organelles or 
developmental processes such as mitotic versus meiotic recombination. 

j> To date, six different genes homologous to MutS have been isolated in yeast 

(yMSH), and their homologs have been found in mouse (mMSH) and human IfrMSH)™ 
respectively. Encoded proteins yMSH2, yMSH3 and yMSH6 appear to be the main MutS 
homologs involved in MMR during mitosis and meiosis in yeast, where the 
complementary proteins MSH3 and MSH6 alternatively associate with MSH2 to recognise 

10 different mismatch substrates (Masischky et al., 1996, Genes Dev. 10, 407-420). Similar 
protein interactions have been demonstrated for the human homologs hMSH2, hMSH3 
and hMSH6 (Acharya et al., 1996, PNAS 93, 13629-13634). 

MutL homologs (MLH and PMS), recently reviewed by Modrich and Lahue (1996, 
Annual Rev. Biochem. 65, 101-133) have so far been found in yeast (yMLHl and 

is yPMSl), mouse (mPMS2) and human (hMLHl, hPMSl and hPMS2). The hPMSl is a 
member of a family of at least 7 genes (Horii et al., 1994, Biochem. Biophys. Res. 
Commun. 204, 1257-1264) and its gene product is most closely related to yPMSl. Prolla 
et al. (1994, Science 265, 1091-1093) presented evidence for yPMSl and yMLHl to 
physically associate with each other and, together, to interact with the MutS homolog 

20 yMSH2 to form a ternary complex involved in mismatch substrate binding. 

However, while medical interest in mismatch repair has prompted extensive 
research on MMR in bacteria, yeast and mammals, MMR genes have not been isolated 
from higher plants prior to the present invention and no attempts to adjust the plant MMR 
to plant breeding needs have been reported. 

25 SUMMARY OF THE INVENTION 

According to a first embodiment of the invention, there is provided an isolated and 
purified DNA molecule comprising a polynucleotide sequence encoding a polypeptide 
involved in the DNA mismatch repair system of a plant. In one form of this embodiment, 
the invention provides an isolated and purified DNA molecule comprising a 
30 polynucleotide sequence encoding a polype ptide which is homologous to a mismatch 
repair polypeptide of a yeast or of a human. More particularly, the invention provides the 
coding sequences of the genes AtMSH3 and AtMSHb of Arabidopsis thaliana,'a^i\efiWCi 
hereinbelow. 

According to a second embodiment of the invention, there is provided a polypeptide 
35 functionally involved in the DNA mismatch repair system of a plant, for example a 
polypeptide which is homologous to a mismatch repair polypeptide of a yeast or of a 
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human such as a polypeptide encoded by the genes AtMSH3 or* htMSHf) <£*Arabvl£p$&l 
thaliana, as defined hereinbelow. 

According to a third embodiment of the invention, there is provided an isolated and 
purified DNA molecule comprising a polynucleotide sequence selected from the group 
5 consisting of (i) a sequence encoding a polynucleotide which is capable of interfering with 



the expression of a plant polynucleotide sequence encoding a polypeptide which is 
homologous to a mismatch repair polypeptide of a yeast or of a human and thereby 
disabling said plant polynucleotide sequence; and (ii) a sequence encoding a polypeptide 
capable of disrupting the DNA mismatch repair system of a plant. 

10 According to a fourth embodiment of the invention there is provided a chimeric 
gene comprising a DNA sequence selected from the group consisting of (i) a sequence 
encoding a polynucleotide which is capable of interfering with the expression of a plant 
polynucleotide sequence encoding a polypeptide which is homologous to a mismatch 
repair polypeptide of a yeast or of a human and thereby disabling said plant 

15 polynucleotide sequence; and (ii) a sequence encoding a polypeptide capable of disrupting 
the DNA mismatch repair system of a plant, together with regulation elements capable of 
functioning in a plant cell. Examples of such regulation elements include constitutive, 
inducible, tissue type specific and ceil type specific promoters such as 35S, NOS, PRla, 
AoPRl and DMCL 

20 In the third and fourth embodiments, said interference, by said polynucleotide 
sequence, with the expression of a plant polynucleotide sequence encoding a polypeptide 
which is homologous to a mismatch repair peptide of a yeast or a human typically occurs 
by hybridisation or by co-suppression. 

According to a fifth embodiment of the invention there is provided a plasmid or 
25 vector comprising a chimeric gene of the fourth embodiment. A vector of the fifth 
embodiment may be, for example, a viral vector. 

According to a sixth embodiment of the invention, there is provided a plant cell 
transformed, transfected or electroporated with a plasmid or vector of the fifth 
embodiment. 

30 According to seventh embodiment of the invention, there is provided a plant 



comprising a cell of the sixth embodiment. 



According to an eighth embodiment of the invention, there is provided a process for 
at least partially inactivating a DNA mismatch repair system of a plant cell, comprising 
transforming or transfecting said plant cell with a DNA sequence of the third embodiment 
35 or a chimeric gene of the fourth embodiment or a plasmid or vector of the fifth 
embodiment, and causing said DNA sequence to express said polynucleotide or said 
polypeptide. 
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According to a ninth embodiment of the invention, there is*provitied*& process fpft 



increasing genetic variation in a plant comprising obtaining a hybrid plant from a first 
plant and a second plant, or cells thereof, said first and second plants being genetically 
different; altering the mismatch repair system in said hybrid plant; permitting said hybrid 
5 plant to self-fertilise and produce offspring plants; and screening said offspring plants for 
^antsnrT^hicTT^ 

recombination may be evidenced by new genetic linkage of a desired characteristic trait or 
of a 2ene which contributes to a desired characteristic trait. 



According to a tenth embodiment of the invention there is provided a process for 
10 obtaining a plant having a desired characteristic, comprising altering the mismatch repair 
system in a plant, cell or plurality of cells of a plant which does not have said desired 
characteristic, permitting mutations to persist in said cells to produce mutated plant cells, 
deriving plants from said mutated plant cells, and screening said plants for a plant having 
said desired characteristic. 

is In a preferred form of the ninth and tenth embodiments of the invention, the step of 
altering the mismatch repair system comprises introducing into said hybrid plant, plant, 
cell or cells a chimeric gene of the fourth embodiment and permitting the chimeric gene to 
express a polynucleotide which is capable of interfering with the expression of a plant 
polynucleotide sequence in a mismatch repair gene of the hybrid plant, plant, cell or cells, 

20 or a polypeptide capable of disrupting the DNA mismatch repair system of the hybrid 
plant or cells. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 provides a diagrammatic representation of the primer sequences used to 
isolate AtMSH3. 

25 Figure 2 is a plasmid map of clone 52, showing restriction enzyme cleavage sites in 
the 5* half of the full-length cDNA for AtMSH3. 

Figure 3 is a plasmid map of clone 13, showing restriction enzyme cleavage sites in 
the 3' half of the full-length cDNA for AtMSH3. 

Figure 4 is a sequence listing of the coding sequence of AtMSH3, together with a 
30 deduced sequence of the encoded polypeptide. 



Figure 5 is a protein alignment ot yeast {saccnaromyceS cerevisiae ) and Arubidupsis 

" ~i}iaUanuMSR3 "pr uleiflT" " — =.= 

Figure 6 provides a diagrammatic representation of the primer sequences used to 
isolate AtMSH6. 

35 Figure 7 is a plasmid map of clone 43, showing restriction enzyme cleavage sites in 
the 5' half of the full-length cDNA for AtMSH6. 

Figure 8 is a plasmid map of clone 62, showing restriction enzyme cleavage sites in 
the 3* half of the full-length cDNA for AtMSH6. 
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Figure 9 is a sequence listing of the coding sequence of AtMSl)6i *tei^ther ^wlth*$; 
deduced sequence of the encoded polypeptide. 

Figure 10 is a protein alignment of yeast (Saccharomyces cerevisiae) and 
Arabidopsis thaliana MSH6 protein. 

Figure 1 1 is a genomic sequence listing of AtMSH6. 
" Figul^rc^is~aT513slm^ — * 1 — 



Figure 13 is a plasmid map of plasmid pPF14. 
Figure 14 is a plasmid map of plasmid pCW186. 
Figure 15 is a plasmid map of plasmid pCW187. 
10 Figure 16 is a diagrammatic representation of an antisense gene construction for use 
in homeologous meiotic recombination. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention is based on the inventors' discovery that there exist in higher 
plants genes which are homologous to MMR genes in E. coli, and to MMR genes in 
15 yeasts and humans. 

Thus, the inventors have identified genes, herein designated AtMSH3 and AtMSH6, 
of the plant Arabidopsis thaliana which encode the proteins AtMSH3 and AtMSH6. 
These plant proteins are homologous to yMSH3 and yMSH6, respectively. The present 
inventors have isolated cDNAs encoding the proteins AtMSH3 and AtMSH6 and have 

20 isolated the complete gene encoding AtMSH6. Given the teaching herein, other genes 
may be obtained (including genes of other plants) which are involved in DNA mismatch 
repair in plants, including other genes which encode polypeptides homologous to MMR 
proteins of yeasts or humans. For example, given the teaching herein, genes of members 
of the Brassicaceae family or of other unrelated families, for example the Poaceae, the 

25 Solanaceae, the Asteraceae, the Malvaceae, the Fabaceae and the Cucurbitaceae family, 
and which encode polypeptides homologous to MMR proteins of yeasts or humans may be 
obtained. 

A procedure which may be followed to obtain genes AtMSH3 and AtMSH6 is 
described in Example 1. Essentially the same technique may be applied to cDNA 
30 obtained by reverse transcription of RNA from other plants. Alternatively, given the 
sequence lnlormation disclosed herein, other degenerate oligonucle otide pinners may b e- 
d te signcd~and~obtaincd for U3C in-isolating-sequences ofplant-mkmateh-repair-g^nes-whiPh- 



are homologous to AtMSH3 or AtMSH6, from other plants. 

The successful gene isolation disclosed herein demonstrates for the first time the 
35 existence of MMR in higher plants and indicates the presence of other plant MMR genes. 
For example, genes encoding the plant homologs of MSH1, MSH2, MSH4, MSH5, 
PMS1, PMS2 and MLH1 may be identified given the teaching herein. Such genes, as 
well as those specifically described herein, separately or in combination, are useful in 
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manipulating the plant MMR for plant breeding purposes. Thus* "for example, thejplajjtf 
MMR may be altered by including in a plant cell a polynucleotide sequence as defined 
herein above with reference to the third embodiment of the invention, and causing the 
polynucleotide sequence to express either a polynucleotide which disables a plant MMR 
5 gene, or a polypeptide which disrupts the plant's MMR system. 

The DNA molecule of the third embodiment of the invention includes a 
polynucleotide sequence (herein referred to as a MMR altering gene) which may for 
example encode sense, antisense or ribozyme molecules characterised by sufficient base 
sequence similarity or complementarity to the gene to be altered to permit the antisense or 
10 ribozyme molecule to hybridise with the plant MMR gene in vivo or to permit the sense 
molecule to participate in co-suppression. Alternatively, the MMR altering gene may 
encode a protein or proteins which interfere with the activity of a plant MMR protein and 
thus disrupt the plant's MMR system. For example, such encoded proteins may be 
antibodies or other proteins capable of interfering with MMR protein function. 

15 A chimeric gene of the fourth embodiment, incorporating a MMR altering gene, 
may be prepared by methods which are known in the art. Similarly, the MMR altering 
gene may be introduced into a plant cell, regenerating tissue or whole plant by techniques 
known in the art as being suitable for plant transformation, or by crossing. Known 
transformation techniques include Agrobacterium tumefaciens or A. rhizogenes mediated 

20 gene transfer, ballistic and chemical methods, and electroporation of protoplasts. 

The MMR altering gene or genes are typically expressed from suitable promoters. 
Suitable promoters may direct constitutive expression, such as the 35S or the NOS 
promoter. Usually, however, the promoter will direct either inducible or tissue specific 
(e.g. callus; embryonic tissue; etc.), cell type specific (e.g. protoplasts; meiocytes; etc.) 

25 or developmental (e.g. embryo) expression of the altering gene or genes, in order for the 
MMR system to function in tissue types or cell types, or at developmental stages of the 
plant, in which it is not desirable for the MMR system to be altered. Using such 
promoters, therefore, the activity of a MMR altering gene may be limited to a specific 
stage during plant development or it may be altered by controlling conditions external to 

30 the plant, and the deleterious effects of a permanently disabled or altered DNA mismatch 
repair system in a plant may be avoided. Examples of suitabl e promoters which are not 

constitutive are known in the art and i nclude inducible promoters such as PR1& (reviewed 

by Gatz, 1997, Annual Rev. Plant Phys. Plant Mol. Biol. 48: 89), tissue specific 
promoters such as AoPRl (Sabahattin et al., 1993, Biotechnology 11: 218), and cell-type 

35 specific promoters such as DMC\. 

A chimaeric gene in accordance with the invention may further be physically linked 
to one or more selection markers such as genes which confer phenotypic traits such as 
herbicide resistance, antibiotic resistance or disease resistance, or which confer some 
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: fertility, era in size! colour, gr©w!f\; 



other recognisable trait such as male sterility, male fertility, 
rate, flowering time, ripening time, etc. 

The process of the tenth embodiment of the invention provides, for example, a 
process for generating intraspecies genetic variation by altering the mismatch repair 
s system in a plant cell, in regenerating plant tissue or in a whole plant. The plant cell, 



regenerating tissue or whole plant includes and expresses one or more MMR altering 
genes which are capable of altering mismatch repair in the plant cell, regenerating tissue 
or whole plant. Alteration of MMR may be achieved, for example, by inactivating the 
genes encoding plant MSH3 and/or plant MSH6. It is preferred to inactivate the plant 

10 MSH3 and MSH6 encoding genes at the same time and in the same plant cell, 
regenerating tissue or whole plant. Typically in this preferred form of the invention 
inactivation of either plant MSH3 or MSH6 alone is insufficient to substantially alter the 
plant's mismatch repair system and only when both MSH3 and MSH6 are inactivated 
simultaneously is the plant's mismatch repair system sufficiently altered to prevent the 

is MMR system from recognising base pair mismatches as a result of DNA replication 
errors, DNA damage, or oligonucleotide induced site-specific mutagenesis. However, in 
some applications of the invention, inactivation of only one gene may also be used to 
cause genomic instability or increase the efficiency of site-specific mutagenesis. 

If desired, the MMR altering gene or genes may later be rendered non-functional or 
20 ineffective, or may be removed from the genome of the plant cell, regenerating tissue or 
whole plant in order to restore mismatch repair in the plant cell, regenerating tissue or 
whole plant. The MMR altering gene or genes may be inactivated by means of known 
gene inactivation tools, such as ribozymes, or may be removed from the genome using 
gene elimination systems known in the art, such as CRE/LOX, It is preferred to render 
25 two genes, whose gene products combine to incapacitate MMR, ineffective by separating 
the altering genes through segregation. Therefore, in a preferred embodiment of the 
invention a first plant cell or plant is generated in which only plant MSH3 is incapacitated, 
and a second plant cell or plant is generated in which only plant MSH6 is incapacitated. 
The combination of both genomes, for example by crossing, then produces significant 
30 MMR deficiency in those cells or plants which have inherited both altering genes. If the 
altering genes are expressed from unlinked loci, gene segregation restores MMR activity 
in the progeny of the cells or plants. " " 



In a process of the ninth embodiment of this invention, homeoiogous recombination 
is enhanced between different genomes, chromosomes or genes in plant cells or plants by 
35 altering MMR in said plant cells or plants. Such genomes, chromosomes or genes are 
characterised in that they originate from different plant families, genera, species, 
subspecies, plant varieties or lines. Hybrid plant cells or hybrid plants may be produced 
by crossing, by cell fusion or by other techniques known in the art. These plant cells or 
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plants are further characterised by expressing one or more genes that *a&,capa$e 6ft 
altering mismatch repair in the plant cell or plants. 

In the process of the ninth embodiment, the homeologous recombination is typically 
for the purpose of introducing a desired characteristic in the hybrid plant. In this typicl 
^ap plica ti on of the process of the ninth embodiment, and iln the process of the tenth 
embodiment the desired characteristic may be any characteristic which is~of valueTo~ffie~ 
plant breeder. Examples of such characteristics are well known in the art and include 
altered composition or quality of leaf or seed derived storage products (e.g. oil, starch, 
protein), altered composition or quality of cell walls (e.g. decrease in lignin content), 
10 altered growth rate, prolonged flowering, increased plant yield or grain yield, altered 
plant morphology, tolerance to or improved performance under environmental stresses of 
various kinds, etc. 

In a preferred form of the tenth embodiment, an MMR altering gene is co- 
introduced along with the homeologous genome, chromosome or gene of another plant 
15 cell or plant into an MMR proficient plant cell or MMR proficient plant to produce a 
hybrid plant cell or hybrid plant in which homeologous recombination can occur. 
Suitably, the MMR proficient plant cell or MMR proficient plant may also include an 
MMR altering gene. For example a gene capable of inactivating plant MSH3 may be co- 
introduced along with the homeologous genome, chromosome or gene of another plant 
20 cell or plant into an MMR proficient plant cell or MMR proficient plant in which MSH6 
is inactivated. A resultant hybrid plant in which homeologous recombination occurs will 
include both the MSH3 and MSH6 altering genes and its MMR system will therefore be 
inactivated. 

In this form of the invention, if hybrid plants are to be produced by crossing, the 
25 MMR altering gene preferably originates from the male parent, thus ensuring that the 
MMR altering gene is always introduced and is not present in the recipient cell. That is, 
the MMR of the recipient cell, prior to introduction of the MMR altering gene, is 
typically proficient. Alternatively, if an MMR altering gene is present in a recipient cell 
it may be ineffective or inefficient on its own, or it may be linked to an inducible or tissue 
30 specific or cell type specific promoter which only renders the MMR altering gene active 
under limited conditions. 

Thus._in-a-prefeTOd„form^^ of t he ninth embodiment, the MMR s ystem 

of the hybrid plant is initially unaltered. In this form of the process, the step of altering 
the mismatch repair system may comprise introducing into the hybrid plant, or cells 
35 thereof, a MMR altering gene, such as by Agrobacteriwn tumefaciens or A. rhizogenes 
mediated gene transfer, ballistic and chemical methods, and electroporation of protoplasts. 

The MMR altering gene or genes are typically expressed from suitable promoters, 
as described above. Preferably, the promoter is transcriptionally active in mitotically and 
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meiotically active tissue and/or cells to ensure MMR alteration after chromosome pairing 
at mitosis and meiosis, respectively. The preferred timing for MMR alteration is at 
meiosis, because recombinant genomes, chromosomes or genes are directly transmitted to 
the progeny. A suitable meiocyte specific promoter is for example the DMCl promoter 
5 from Arabidopsis thaliana ssp. L.er. (Klimyuk and Jones, 1997, Plant J. 11, 1-14). 
- — Howe ver—mi totie— faomeolo gous— reeomb inatien— i s— als o— a— de sirab le— outcome— as— s omati& 
recombination events can be transmitted to offspring due to the totipotency of plant cells 
and the lack of predetermined germ cells in plants. 

If desired, the MMR altering gene or genes may later be rendered non-functional or 
10 ineffective, or may be removed from the hybrid plant or hybrid plant cells, in order to 
restore mismatch repair in the hybrid plant or hybrid plant cells. The MMR altering gene 
or genes may be inactivated by means of known gene inactivation tools as described 
herein above. 

EXAMPLES 

is Example 1. Cloning of the AtMSH3 and AtMSH6 coding sequences 

Isolation of partial AtMSH3 and AtMSH6 consensus sequences 

Degenerate oligonucleotides UPMU and DOMU 

UPMU CTGGATCCACIGGICCIAA(C/T)ATG 
DOMU CTGGATCC(A/G)TA(A/G)TGIGTI(A/G)C(A/G)AA 
20 were used to isolate AtMSH3 and AtMSH6 sequences by PCR amplification. 

Primers UPMU and DOMU correspond to conserved amino acid sequences of the 
proteins MutS (E. coli and S. typhimurium), Hex A (5. pneumoniae), Repl (mouse) and 
Duel (human). The conserved regions to which they are targeted are TGPNM for 
UPMU (amino acid positions 852-856 for AtMSH6 and 817-821 for AtMSH3) and 
25 FATHY for DOMU (amino acid positions 965-969 for AtMSH6 and 929-933 for 
AtMSH3.) These primers have been used to isolate MSH2 and MSH1 from yeast 
(Reenan and Kolodner, Genetics 132: 963-973 (1992)) and MSH2 from Xenopus and 
mouse (Varlet et aL, Nuc, Acids Res. 22:5723-5728 (1994)). 

Template single strand cDNA was produced by reverse transcription of 2 /xg total 
30 RNA from a cell suspension culture of Arabidopsis thaliana ecotype Columbia (Axelos et 
al. 1989, Mol. Gen. Genetics 219: 106-112). The PCR reaction was performed under the 
fcTlowing conditionl^irriri Qria^ 

1XPCR buffer, lu Tag DNA polymerase (Appligene) in the presence of template cDNA. 
PCR parameters were 5 minutes at 94°C, followed by 30 cycles of 40 seconds at 95 °C, 
35 90 seconds at 45 °C, 1 minute at 72 °C. The amplification products were cloned into 
pGEM-T vector (Promega) and sequenced. Two different clones were isolated, S5 
(350bp) was homologous to MSH3, S8 (327bp) was homologous to MSH6. Complete 
cDNA sequences were then isolated according to the Marathon cDNA amplification kit 
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procedure (Clontech). In summary, this procedure involves producing double stranded 
cDNA by reverse transcription of 2ptg polyA+ RNA from the cell suspension culture of 
Arabidopsis. Adaptors are ligated on each side of the cDNA. The ligated cDNA is used 
as a template for 5' and 3' RACE PCR reactions in the presence of primers that are 
specific for the adaptor on one side (API and AP2), and specific for the targeted gene on 
TKe^TheTiiBe^T^ 

complete gene coding sequence can be reconstituted taking advantage of a unique 
restriction site, if available, in the overlapping region. Specific details of this procedure 
as it was used to isolate AtMSHB and AtMSH6 coding regions, are as follows. 

Isolation of AtMSH 3 complete coding sequence 

From the sequence of clone S5, primer 636 was designed: 
636 TGCT AGTGCCTCTTGC AAGCTC AT . 

Primer API is complementary to a portion of an adaptor sequence which had been 
ligated to the 5' and 3' ends of Arabidopsis cDNA: 

API CC ATCCTAATACGACTC ACT ATAGGGC. 

PCR performed on the ligated cDNA with primers 636 and API for the 5' RACE 
PCR was followed by a second round of amplification with the nested primers AP2 and 
S525 

AP2 ACTCACTATAGGGCTCGAGCGGC 

S525 AGGTTCTGATTATGTGTGACGCTTTACTTA 
(the latter was also designed to correspond to a part of the sequence of clone S5) and 
produced a 2720bp DNA fragment. Figure 1 provides a diagrammatic representation of 
the primer sequences used to isolate AtMSH3. Another primer (S51) 

S5 1 GGATCGGGTACTGGGTTTTGAGTGTGAGG 
was designed closer to the 5' border and permitted the determination of 99bp upstream to 
the ATG initiation codon. For the 3' RACE PCR, a first PCR reaction was performed 
with primers API and 635, 

635 GCACGTGCTTGATGGTGTTTTCAC 
followed by a second round of amplification, using the nested primers AP2 and S523 

S523 TCAGACAGTATCCAGCATGGCAGAAGTA 
which produced a DNA fragment of 890bp. Both DNA fragments were subcloned into 
pGEM-T and sequenced. Since Pl,R amplification using the Expand L u ng Template PCR 

were designed to isolate those sequences again by PCR, but with the high fidelity DNA 

polymerase Pfu. PCR with primers 1S5 and S53 

1 S5 ATCCCGGG ATGGGC AAGC AA A AGC AGC AG ACG A 

S53 GACAAAGAGCGAAATGAGGCCCCTTGG 

amplified the 1244bp fragment clone 52 (cloned into pUC18/Smal). PCR with primers 

S52 and 2S5 
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13 

2S5 atcccgggtcaaaatgaXc^agttggttttagtc 

S52 GCCACATCTGACTGTTCAAGCCCTCGC 
amplified the 2104bp clone 13 (cloned into pUC18/Smal). These two clones were iigated 
after digestion by the BarnHl restriction enzyme for which a unique site is present in their 
5 overlapping region. The remaining primers referred to in Figure 1 are as follows: 

S51 GGvW6GGGT^€TGGGTTTTGAGTGT-GAGG— — — _ 

S525 AGGTTCTGATTATGTGTGACGCTTTACTTA 
Figures 2 and 3 provide plasmid maps of clones 52 and 13 respectively, showing 
restriction enzyme cleavage sites. The complete AtMSH3 coding sequence is 3246bp long 
10 and is shown in Figure 4 together with the deduced sequence of the encoded polypeptide. 
AtMSH3 is clearly homologous to the yeast and mouse MSH3 genes. A sequence 
alignment of polypeptides encoded by AtMSH3 and that encoded by Saccharomyces 
cerevisiae MSH3 is set out in Figure 5 . 

Isolation of the AtMSH6 complete coding sequence and genomic sequences 

15 The same procedure allowed isolation of the AtMSH6 cDNA. Figure 6 provides a 
diagrammatic representation of the primer sequences used to isolate AtMSH6. For the 5' 
RACE PCR, primers 638 and API 

638 TCTCTACCAGGTGACGAAAAACCG 
allowed the amplification of a 2889 DNA fragment. Primer S81 
20 S8 1 CGTCGCCTTTAGC ATCCCCTTCCTTC AC 

helped define the 142bp upstream to the ATG initiation codon. On the 3' side, RACE 
PCR was initially performed with primers S823 and API, 

S823 GCTTGGCGCATCTAATAGAATCATGACAGG 
and then with the nested primers 637 and AP2, 
25 637 GACAGCGTCAGTTCTTCAGAATGC 

to produce a 774bp DNA fragment. As for AtMSH3, those fragments were cloned and 
sequenced. Re-isolation of the DNA sequence using the high fidelity Pfu polymerase and 
newly designed primers 1S8 and S83 (for the 5' side) led to a 2182 bp DNA fragment 
identified as clone 43 (cloned in pUC18/Smal), and a 1379bp clone identified as clone 62 
30 (also cloned in pUC18/Smal). 

1S8 ATCCCGGGATGCAGCGCCAGAGATCGATTTTGT 
2S8 ATCCCGGGTTATTTGGGAACACAGTAAGAGGATT 



S83 CGCTATCTATGGCTGCTTCGAATTGAG 
35 Figures 7 and 8 provide plasmid maps of clones 43 and 62 respectively, showing 
restriction enzyme cleavage sites. Clones 43 and 62 were digested by the Xmnl 
restriction enzyme for which a unique site is present in their overlapping region and then 
Iigated. The complete AtMSH6 coding sequence is 3330bp long and is shown in Figure 9 
together with the deduced sequence of the encoded polypeptide. AtMSH6 is clearly 
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homologous to the yeast and mouse M£//6genes. A s'equence alignment o"f" polypeptides 
encoded by AtMSH6 and that encoded by Saccharomyces cerevisiae MSH6 is set out in 
Figure 10. 

An AtMSH6 genomic sequence was also isolated from a genomic DNA library 
constituted after partial Sau3Al digestion of DNA from the Arabidopsis cell suspension. 
8062r5p^efe~sequ^^^ 

cDNA. 16 introns are found scattered along the gene. The complete genomic sequence is 
shown in Figure 1 1 . 

Example 2. A measure of somatic variation in MMR deficient plants 

Constructs 

Constructs with antisense AtMSH3 or antisense AtMSH6 or both AtMSH3/AtMSH6 
under the control of a single 35S promoter have been inserted into the binary vector 
pPZP121 between the right and left borders of the T-DNA. The pPZP121 plasmid 
confers chloramphenicol resistance to Escherichia coli or Agrobacterium tumefaciens 
bacteria. The aacCl gene is carried by the T-DNA and allows selection of transformed 
plant cells on gentamycine (Hajdukiewicz et al., Plant Mol. Biol. 25, 989-994). For the 
purpose of expressing antisense constructs, a 35S promoter/terminator cassette with a 
polylinker was introduced into pPZP121. The 3' ends of the considered genes have been 
chosen since this region seems more efficient for antisense inhibition. For AtMSH3 this 
corresponds to clone 13 (2104bp), for AtMSH6 this corresponds to clone 62 (1379bp). 
Clone 13 comprises 2104bp of the 3* region that were cut off the pUC18 vector by 
Sall/Sstl restriction, blunted with T4 DNA polymerase and ligated into the T4 DNA 
polymerase blunted BamHl site of pPZP121/35S, creating clone pPF13. The same 
procedure was followed for the 3' region of AtMH6 clone 62 (1379bp) thus creating 
plasmid pPF14. For the double constructs, the 3' region (from clone 62) of AtMSH6 was 
introduced ahead of the AtMSH3 region into pPF13 creating pCW186 and reciprocally, 
the 3' region of AtMSH3 (from clone 13) was introduced ahead of AtMSH6 into pPF14, 
creating pCW187. Figures 12-15 provide plasmid maps of plasmids pPF13, pPF14, pCW 
186 and pCW187, respectively. 

Plant cell transformation 

lant cell s 



atc~ men~transfo m i e d by-co cultivation A-™sppmion-caittiirp.- ^ ^ 

cells that has been established by Axelos et al. (1992, Plant Physiol. Biochem. 30, 1-6) 
may be used. One day old freshly subcultured cells are diluted five times into AT 
medium (Gamborg B5 medium, 30g/l sucrose, 200/Ltg/l NAA). 10/xl of saturated 
Agrobacterium containing the transforming T-DNA constructs are added to 10ml diluted 
cells in a 100ml erlenmeyer. The co-cultivation is agitated slowly (80rpm) for 2 days. 
The cells are then washed 3 times into AT medium and finally resuspended in the same 
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initial volume (10ml). The culture is agitated for t days* to* "allow* expression before 
plating on selection plates (AT/BactoAgar 8g/l+gentamycine 5(Vg/ml). Transformed 
individual calli are isolated 3 weeks later. 

Microsatellite analysis 

5 , Microsatellit es have been described in Arabidopsis (Bell and Ecker, 1994, Genomics 

19, 137-144). We chose to study instability of microsatellites that are polymorphic 
amongst different ecotypes. DNA is extracted from the transformed calli. Specific 
primers have been defined that are used to amplify the microsatellite sequence. One of 
the two primers is previously P 32 labelled by T4 kinase. In case of a polymorphic 

10 variation, new PCR products appear that do not follow the expected pattern of migration 
on a polyacrylamide gel. This is a commonly observed feature for MMR deficient cells 
in yeast or mammalian cells. 

Forward mutagenesis assay 

A single copy of the codA gene has been introduced in the cells prior to 
is transformation by the antisense constructs. This codA gene renders the cells sensitive to 
the base analog 5-fluoro-cytosine (Perera et al., 1993, Plant Mol. Biol. 23, 793-799). 
Inactivation of MMR is expected to increase mutagenesis. Newly arising mutations will 
eventually inactivate the codA gene conferring resistance to 5 fluoro-cytosine. The codA 
gene from resistant clones will be isolated by PCR and sequenced to check for the 
20 occurrence of mutations in its coding sequence. 

Example 3. Homeologous meiotic recombination in Arabidopsis thaliana 

A. Construction of a meiocyte specific gene expression cassette comprising the DMCl 
promoter and the NOS terminator 

A 1.8 kb DNA fragment comprising the 3' terminal part of the meiocyte specific 
25 DMCl promoter (Klimyuk and Jones, 1997, Plant J. 11, 1-14) was isolated by PCR from 
purified genomic DNA of Arabidopsis thaliana (ssp, Landsber erecta "Ler"). The 
forward PCR primer (DMCl a) contained the sequence 

acgcgtcgacGAATTCGCAAGTGGGG 
and introduced a Sail cloning site at the 5' end of the promoter fragment. The reverse 

™ PPtt primer (DMCIh) con tained the sequence 

t c^tggagajctc ccgggtacCGATTTGCTTCGAGGG 
introducing a poly linker region at the 3' end of the promoter fragment. The PCR reaction 
was carried out with VENT DNA Polymerase (NEB) over 25 cycles using the following 
cycling protocol: 1 minute at 94°C, 1 minute at 54°C, 2 minutes at 72°C. 

35 The PCR reaction yielded a blunt ended DNA fragment which was digested with 
restriction enzyme Sail and was cloned into the cleavage sites of restriction enzymes Sail 
and Smal in plasmid p2030, a pUC118 derivative containing the SacI-EcoRl NOS 
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terminator fragment of pBIN19. The cloning yielded pfasmid p205*l/ containing the 
DMCl promoter-polylinker-NOS terminator expression cassette depicted in Figure 16. 

Construction of an MSH3 antisense gene under the control of the DMCl promoter 

A 2.1 kb DNA fragment encoding the carboxy terminal part of AtMSH3 was 
^5-removed- fromahe^oJyiirikei^^ with 
Kpn\, (ii) blunting of the DNA ends generated by Kpnl and (iii) digestion with BamBl. 
The isolated fragment was then cloned in antisense orientation downstream of the DMCl 
promoter in plasmid p2031, which had been digested with restriction enzymes Smal and 
Bglll. This cloning yielded plasmid p2033 (Figure 16). 

10 After digestion of p2033 with EcoRl, a 4.1 kb DNA fragment was recovered 
comprising the DMCl promoter, the partial MSH3 cDNA in antisense orientation with 
respect to the promoter and the NOS terminator. This fragment was cloned into the 
EcoRl restriction site of plant transformation vector pNOS-Hyg-SCV to yield plasmid 
p3242 (Figure 16). 

15 C. Construction of a combined MSH3/MSH6 antisense gene under the control of a 
single DMCl promoter 

A 1 kb DNA fragment encoding the carboxyterminal part of ATMSH6 is isolated 
from clone 62 described in Example 1 after digestion of clone 62 plasmid DNA with 
BamHl, which cleaves in the 5' polylinker region flanking the partial cDNA, and with 

20 EcoKL, which cleaves within the cDNA. The isolated fragment is treated with Klenow 
enzyme to blunt both its ends and is cloned into the recipient plasmid p2033 or p3242. 
For the purpose of cloning, the recipient plasmid may be cleaved with either Aval or Ncol 
and can be blunted with Klenow enzyme to produce blunt acceptor ends for fragment 
cloning. This cloning yields two opposite orientations of cloned fragment DNA with 

25 respect to the DMCl promoter. These can be identified by diagnostic digestion with 
restriction enzymes Seal or Xmnl in conjunction with Sad. The selected construct 
contains the DMCl promoter, the combined partial cDNAs for AtMSH3 and AtMSHS 
(both cloned in antisense orientation with respect to the DMCl promoter) and the NOS 
terminator. If the recipient plasmid is p2033, the combined antisense gene under control 

30 the single DMCl promoter is recovered from the vector after EcoBl digestion and is 
cloned into the hcoRl restrictio n site of pNOS-Hyg-SCV. 



D. Sele ction of Recombination markers on nomeologous chromosomes ot^tn 
maliana subspecies Landsbers erecta (Ler\ Columbia (CoW and C24. respectively 

Pfi), Visual recombination markers in Arabidopsis th. subspecies C24: 



35 Agrobacterium mediated transformation with a T-DNA containing a 35S-GUS gene, 
inactivated by insertion of a 35S-Ac transposable element (Finnegan et al., 1993, Plant 
Mol. Biol. 22, 625-633), had yielded a C24 line in which the T-DNA construct was 
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integrated into chromosome 2. Genetic and molecular "analysis of this line* snows tnat tne 
Ac transposon had excised from its T-DNA locus thereby restoring GUS activity, but had 
re-inserted into the chromosome at a distance of 16.4 cM, where it stayed fixed (due to 
disablement of Ac) within the chlorina gene. Insertional inactivation of the chlorina gene 
5 caused a bleached phenotype in those plants that were homozygous for this mutation. 
Because oYlh£"tw 

used in crosses to wildtype Ler for analysis of meiotic homeologous recombination on 
chromosome 2 in conjunction with molecular recombination markers. 

DCin. Visual recombination markers in Arabidopsis th. Ler. 

io The Ler line NW1 (obtained from NASC, Nottingham, UK) contains one recessive 
visual marker per chromosome, i.e. an-1 on Chr.l, py-1 on Chr.2, gll-1 on Chr.3, cer2- 
1 on Chr.4, and msl-1 on Chr.5. This line was used in crosses to wildtype C24 for 
analysis of meiotic homeologous recombination on chromosomes 1-5 in conjunction with 
molecular recombination markers. 

15 Driii^ Molecular recombination markers in Col. Ler and C24: 

The genome of Arabidopsis thaliana is interspersed with unique base sequences 
arranged as simple tandem repeats. Allelic repeats can vary in length between different 
Arabidopsis subspecies and when amplified by PCR yield diagnostic DNA products of 
different length named Simple Sequence Length Polymorphisms (SSLPs). Many SSLPs 
20 have been genetically mapped and have been assigned to unique chromosome locations on 
the recombinant inbred map (Bell and Ecker, 1994, Genomics 19, 137-144; Lister and 
Deans lines, Weeds World 4i, May 1997). 

In Table 1 are listed 28 mapped and established SSLPs between Ler and Col. A 
number of PCR primer pairs are described herein which also yielded SSLPs between C24 
25 and Ler (19 SSLPs) or between C24 and Col (25 SSLPs), respectively. Polymorphic 
SSLPs can be used as molecular markers in the analysis of homeologous recombination 
between genomes from these subspecies. 

The PCR reactions (25 \iL) were carried out over 35 cycles (15 seconds at 94°C, 30 
seconds at 55°C and 30 seconds at 72°C), with 0.25 U Taq DNA polymerase and 0.6 fig 

30 genomic DNA in reaction buf fer containing 2 mM MgCl2- PCR products were separated 
by agarose gel electrophoresis (4% ultra high resolution agarose) and visualised by 
ethidiumbromide staining. The results Irom tne FUR experiments are SUimttaris^~hT 
Table 1, which also shows the sequence of PCR primers, primer annealing temperature, 
and chromosome location of SSLP (with respect to the RI map of May 1997, Weeds 

35 World 4i). 
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E. Production of hybrid plants 



C24 plants heterozygous for chlorina3A:AcIGUS were crossed as male to 
emasculated wildtype Ler to produce Ler / C24(chlorina3A, GUS) hybrid seeds. 

Ler plants homozygous for the five chromosome markers were male sterile (msl-1) 
^5 anri-were-cmftseri without erna sculationJQjwildtvpe C24 to produce Ler(an-1, py-1, 
cerl-l, msl-l)/C24 hybrid seeds. 

F. Introduction of MSH3 and MSH316 antisense genes into hybrid plants and analysis 
of meiotic homeologous recombination 

The plant transformation vectors comprising the antisense genes described in (B) 
10 and (C) above are introduced into Agrobacterium tumefaciens strain AGLl (Lazo et al., 
1991, Bio/Technology 9, 963-967) by electroporation. Recombinant Agrobacterium 
clones are selected on LB medium containing 50 mg/L rifampicin and 50 mg/L 
kanamycin. Selected clones are used to infect roots of Arabidopsis hybrid plants 
(described in (E) above) using the root transformation protocol of Valvekens et al. (1988, 
15 PNAS 85, 5536-5540) except that shoot and root inducing media contain hygromycin (10 
mg/L) instead of kanamycin. 

Plants regenerated from roots of hybrid plants are genetic clones of root donating 
plants and therefore are again genetic hybrids of two Arabidopsis subspecies described in 
(E). However, in contrast to the root donating plants, the regenerated hybrid plants also 
20 contain the introduced transgene and the co-introduced hygromycin resistance gene with 
the latter allowing these plants to grow on hygromycin containing culture medium. 

Hygromycin resistant plants are then allowed to enter the reproductive phase and to 
produce gametes by meiotic divisions of microspore and megaspore mothercells. At 
meiosis. the DMC1 promoter is activated and can direct the expression of antisense genes 
25 described in (B) and (C) above, leading to decreased MSH6 and/or MSH3 gene 
expression. This in turn depletes the gamete mothercells of MSH6 and/or MSH3 protein, 
thus causing alteration of MMR during meiotic divisions and increasing the recombination 
frequency between homeologous chromosomes. 

Transgenic plants are then allowed to self-fertilise and to produce seeds. These 
m <=eeH<? (F9 gppHs with respect to hybrid production), and the plants derived therefrom, 
carry the .homeologous recombination events which can be identified b y using the visual 



and molecular recombination markers described in (D) above. 

In case of homeologous recombination between chromosomes of Ler and 
C24(chlorina3A:Ac, GUS), the analysis concentrates on chromosome 2 by selecting plants 
35 showing the visual phenotypic marker chlorina. This marker thus serves as a reference 
point as it indicates that respective chromosomes 2 originate from C24. Other markers, 
such as GUS or molecular markers, on chromosome 2 may then be used to identify 
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chromosomal regions which are derived from the *Ler "chromosome asT'a res*ult*of 
homeologous recombination. F2 plants of control trans formants not expressing the 
antisense gene(s) can be analysed in parallel and the results can be used for comparison to 
homeologous recombination results obtained in antisense plants, 

5 In case of homeologous recombination between chromosomes of C24 and Ler (an-1, 
py-1, gll-1, cer2-l, msl-1), the analysis is similar to that described above, except that the 
presence of a visual marker on each chromosome facilitates the study of homeologous 
recombination on all five chromosomes. 
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Figure 2 



Comments/References: 52= 3' side of 35 (AtMSHS) 1244bp in pUCl8/Sma1 




Comments/References: 13 = 3* sioc- of 55 (Ali/iSH.2} 21C4bp in pUCl8/Sma1 
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1 CCTAAGAAAGCGCGCGAAAATTGGCAACCCAAGTTCGCCATAGCCACGACCACGACCTTCCATTTCTCTTAAACGGAGGA 8 0 

SI G AT T AC G AAT AAAGC AATT ATG GGC AAG CAA AAG CAG CAG ACG ATT TCT CGT TTC TTC GCT CCC 144 
1 MGKQKQQTIS?. FFAP15 

14 5 AAA CCC AAA TCC CCG ACT CAC GAA CCG AAT CCG GTA GCC GAA TCA TCA ACA CCG CCA CCG 204 

205 AAG ATA TCC GCC ACT GTA TCC TTC TCT CCT TCC AAC CGT AAG CTT CTC TCC GAC CAC CTC 2 64 

36 K I S A T V S F S ? S K R K L L £ D H L 55 

2 5= GCC GCC GCG TCA CCC AAA AAG CCT AAA CTT TCT CCT CAC ACT CAA AAC CCA GTA CCC GAT 324 
55 A A A S P K K P K L S V H T ^ U ? V ? D 7 5 

3 25 CCC AAT TTA CAC CAA AGA TTT CTC CAG AGA TTT CTC GAA CCC 7CG CCG GAG GAA TAT GTT 3 84 
75PNLHQRFLQR FLEPSPEEYV 95 

2 c 5 CCC GAA ACG TCA TCA TCG AGG AAA TAC ACA CCA TTC GAA CAG CAA GTG GTG GAG CTA AAG 444 

= 5 ? E T S S S R K Y T P L E Q Z- V V E L K 115 

445 AC-C AAG TAC CCA GAT GTG GTT TTG ATG GTG GAA GTT GGC TAC AGG TAC AGA TTC TTC GGA 504 

115 S K Y P D V V L M V E V G Y = Y R F F G 13 5 

525 GAA GAC GCG GAG ATC GCA GCA CGC GTG TTG GGT AT 7 TAC GCT' CAT ATG GAT CAC AAT TTC 5 64 

13 5 E D A S I A A R V L G I Y A H " C* H N F 15 5 

55 5 ATG ACG GCG AGT GTG CCA ACA TTT CGA TTG AAT TTC CAT GTG AGA AGA CTC GTG AAT GCA 62 4 

15 5 K T A S V P T F R L K ? H 7 ?. P. L V N A 17 5 

£25 GGA TAC AAG ATT GGT GTA GTG AAG CAG ACT GAA A "7 GCA GCC ATT AAG CCC CAT GGT GCA 634 

1~5 G Y K I G V V K Q T E T A A C ?I F H G A 195 

£35 AAC CGG ACC GGC CCT TTT TTC CGG GGA CTG TCG GCC- TTC- TAT ACC AAA GCC ACG CTT GAA 74 4 

15 6 U R T G ? F F R G C S A L Y T K A T L E 215 

74 5 GCG GCT GAG GAT ATA AGT GGT GGT TGT GGT GGT GAA GAA GGT TTT GGT TCA CAG AGT AAT 604 

215 A A E D I S G G C G G E E G F G £ Q S N 235 

='0 5 TTC TTG GTT TGT GTT GTG GAT GAG AGA GTT AAG TCC GAC ACA TTA GGC TCT GGT ATT GAA 8 54 



V C V V D E R V S E T C- G C G 1 



= 55 ATC- AGT TTT GAT GTT AGA GTC GGT GTT GTT GGC GTT GAA ATT TCG ACA GGT GAA GTT GTT 924 

2 5 5 :■: 3 F D V R V G V V G V E I S T G E V V 2 7 5 

-25 TAT GAA GAG TTC AAT GAT AAT TTC ATG AGA AGT GCA TTA CAC CCT GTG ATT TTG AG C TTG 934 

27 5 Y E E F K D N F M ?. S G L E A Y C L S L 2 95 

5 55 TCA CCA GCT GAG CTG TTG CTT GGC CAG CCT CTT TCA CAA CAA ACT GAG AAG TTT TTG GTG 10 4 4 

? A E L L> L G Q ? L £ Q Q T E Y. F L V 315 

104 5 GCA CAT GCT GGA CCT ACC TCA AAC GTT CGA GTG GAA CGT GCC TCA CTG GAT TGT TTC AGC 1104 

315 A H A G P T S N V R V E R A S L D C F S 32 5 

1105 AAT GGT AAT GCA GTA GAT GAG GTT ATT TCA TTA TGT GAA AAA ATC AGC GCA GGT AAC TTA 1164 

3 35 I,' G N A V D E V I S L C E K I S A G N L 355 

1155 GAA GAT GAT AAA GAA ATG AAG CTG GAG GCT GCT GAA AAA GGA ATG TCT TGC TTG ACA GTT 12 24 

355E D DK EMKL EAA EKGM S C L TV 375 

12 55 GAT AGA ATT- ?\TC . A AC aT - P . rri rr,f r^C ^CT GTT CAA GCC C T*" ^rr rTa ^rn ttt Tnr r^T I?p4 

376H T I M NM P H L TV QA L A L T F C H 395 



12 S 5 CTC AAA CAG TTT GGA TTT GAA AGG ATC CTT TAC CAA GGG GCC TCA TTT CGC TCT TTG TCA 1344 

396L KQFGFER I LYQGAS FR S LS 415 

13 4 5 AGT AAC ACA GAG ATG ACT CTC TCA GCC AAT ACT CTG CAA CAG TTG GAG GTT GTG AAA AAT 1404 
416 SNTEMTLSANTLQQLEVVKN 435 

14 05 AAT TCA GAT GGA TCG GAA TCT GGC TCC TTA TTC CAT AAT ATG AAT CAC ACA CTT ACA GTA 14 64 
436 NSDGSESGSLFHNMNHTLTV 455 

14 65 TAT GGT TCC AGG CTT CTT AGA CAC TGG GTG ACT CAT CCT CTA TGC GAT AGA AAT TTG ATA 1524 

456 YGSRLLRHWVTHPLC DRNLI 475 



15 25 TCT GCT CGG CTT GAT GCT GTT TCT GAG ATT TCT GCT TGC ATG GGA TCT CAT AGT TCT TCC 
476 SARLDAVSEISACMGSHSSS 



1584 
495 
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1585 CAG CTC AGC AGT GAG TTG GTT GAA GAA GGT TCT GAG *AGA GCA *A*TT GTA TCA* # CCT*GAG TTT 164 4 

496QLSSELVEEGSERAIVSPEF 515 

164 5 TAT CTC GTG CTC TCC TCA GTC TTG ACA GCT ATG TCT AGA TCA TCT GAT ATT CAA CGT GGA 17 04 

516 YLVLSSVLTAMSRSSDI Q R G 535 

17 05 ATA ACA AGA ATC TTT CAT CGG ACT GCT AAA GCC ACA GAG TTC ATT GCA GTT ' ATG GAA GCT 1764 

53 61 TR I F HRTA K A TEFI A V M EA 555 

17 55 ATT TTA CTT GCG GGG AAG CAA ATT CAG CGG CTT GGC ATA AAG CAA GAC TCT GAA ATG AGG 1824 

— S3£-I ~L™-b A G K 0 I O , R , L, G I K 0 D S S n M R 5_75_ 

182 5 AGT ATG CAA TCT GCA ACT GTG CGA TCT ACT CTT TTG AGA AAA TTG ATT TCT GTT ATT TCA 1884 

576 S M Q SATVRSTLLRKL I S V I S 595 

1885 TCC CCT GTT GTG GTT GAC AAT GCC GGA AAA CTT CTC TCT GCC CTA AAT AAG GAA GCG GCT 1944 

59 6 S P V V V D N A G K L L S A L M K E A A 615 

154 5 GTT CGA GGT GAC TTG CTC GAC ATA CTA ATC ACT TCC AGC GAC CAA TTT CCT GAG CTT GCT 2004 

516V R G D L L D I L I T S S D Q F ? E L A 635 

2005 GAA GCT CGC CAA GCA GTT TTA GTC ATC ACG GAA AAG CTG GAT TCC TCG ATA GCT TCA TTT 2 0 64 

63 5 E A R Q A V L V I ?. E K L D S S I A S F 655 

20 55 CGC AAG AAG CTC GCT ATT CGA AAT TTG GAA TTT CTT CAA GTS TCG GGG ATC ACA CAT TTG 2124 
555 R K K L A I ?. N L Z F L Q V S G I 7 H L 575 

212 5 ATA GAG CTG CCC GTT GAT TCC AAG GTC CTT ATG AAT TGG GTC- AAA GTA AAT AGC ACC AAG 2184 

5 1 5 1 Z L ? V D S K V ? M I] V? V K V M 3 T K 69 5 

21 £5 AAG ACT ATT CGA TAT CAT CCC CCA GAA ATA GTA GCT GGC TTG GAT GAG CTA GCT CTA GCA 2 2 44 
z ~ 5 K T I R V H ? P £ I V A G L D E L A L A 715 

22 4 5 ACT GAA CAT CTT GCC ATT GTG AAC CGA *J7? TCG TGG GAT AGT TTC CTC AAG AGT TTC AGT 2 3 04 
"Id T E H L A I V N R A S V. 1 D S F 1- K S ? S 73 5 

23 0 5 AGA TAC TAC ACA GAT TTT AAG GCT GCC GTT CAA GCT CTT GCT GCA CTG GAC TGT TTG CAC 2 3 64 
7 3 6 R. V y T D F K A A 7 Q A L A A L D C L H 7 5 5 

25 6 5 TCC CTT TCA ACT CTA TCT AGA AAC AAG AAC TAT GTC CGT CCC GAG TTT GTG GAT GAC TGT 24 24 

7 56 S L S T h S R N K K Y V R P E F V D D C 77 5 

24 2 5 GAA CCA GTT GAG ATA AAC ATA CAG TCT GST CGT CAT CCT GTA CTG GAG ACT ATA TTA CAA 24 84 
7 7 6 E P V 5 Z N I Q S 2- R H P V L E T I L Q 7 95 

24 S 5 GAT AAC TTC GTC CCA AAT C-AC ACA ATT TTG CAT GCA GAA GGC- GAA TAT TGC CAA A.TT ATC 2 544 
7 5 6 D N F V ? M D T I 2 H A E G E V C Q I I B15 

2 54 5 ACC GGA CCT .AAC ATG GGA GGA AAG AGC T2C TAT ATC CGT CAA. GTT GCT TTA ATT TCC ATA 2 6 04 

S 1 6 T G P N M G C- K S Z V I R Q V A L I S I 8 3 5 

2 605 ATG GCT CAG GTT GGT TCC TTT GTA CCA GCC- TCA TTC GCC AAG CTG CAC C-TG CTT GAT GGT 2 664 

£36 M A Q V G S ? V ? A S F A K L H V L D G 855 

25 5 5 GTT TTC ACT CGG A.TG GGT GCT TCA GAC AGT ATC CAG CAT GGC AGA AGT ACC TTT CTA GAA 27 24 
856V F T R M G A S D S I Q H G R S T F L E 875 

272 5 GAA TTA AGT GAA GCG TCA CAC ATA ATC AGA ACC TGT TCT TCT CGT TCG CTT GTT ATA TTA 2 7 84 

S76 ELSEASHIIRTCSSRSLVIL 895 

27 3 5 GAT GAG CTT GGA AGA GGC ACT AGC ACA CAC GAC GGT GTA GCC ATT GCC TAT GCA ACA TTA 2 844 

896 DELGRGTSTKDGVAIAYATL 915 



28 4 5 CAft CAT CTC CT7\ GCA GAA JVT i G TifT i TPT TTP ttTT PTT TTT P . TP RPG r&T Tar CCT BAA ATA 2904 
916 QHLLAEKRCLVLFVTHYPEI 935 



2905 GCT GAG ATC AGT AAC GGA TTC CCA GGT TCT GTT GGG ACA TAC CAT GTC TCG TAT CTG ACA 2T&T 

936 AEISNGFPGSVGTYHVSYLT 955 

2955 TTG CAG AAG GAT AAA GGC AGT TAT GAT CAT GAT GAT GTG ACC TAC CTA TAT AAG CTT GTG 3024 

956 LQKDKGSYDHDDVTYLYKLV 975 

302 5 CGT GGT CTT TGC AGC AGG AGC TTT GGT TTT AAG GTT GCT CAG CTT GCC CAG ATA CCT CCA 30 84 

976 RGLCSRSFGFKVAQLAQIPP 995 

30 85 TCA TGT ATA CGT CGA GCC ATT TCA ATG GCT GCA AAA TTG GAA GCT GAG GTA CGT GCA AGA 3144 

996 SCIRRAI SMAAKLEAEVRAR* 1015 

314 5 GAG AGA AAT ACA CGC ATG GGA GAA CCA GAA GGA CAT GAA GAA CCG AGA GGC GCA GAA GAA 3 204 

1016 ERNTRMGEPEGHEEPRGA EE 1035 



32 05 TCT ATT TCG GCT CTA GGT GAC TTG TTT GCA GAC CTG AAA TTT GCT CTC TCT GAA GAG GAC 
1036 SISALGDLFADLKFALSEED 



3264 
1055 
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figure 4 (continued):**: : ## : : ## 

c x • • ' • • ••••••••••• 

• • •••• • 

3 26 5 CCT TGG AAA GCA TTC GAG TTT TTA AAG CAT GCT TGG AAG ATT GCT GGC AAA %TC*AGA CTA* * 3 32*4* 

1056 PWKAFEFLKHAWKIAGKI RL 1075 

33 25 AAA CCA ACT TGT TCA TTT TGA TTTAATCTTAACATTATAGCAACTGCAAGGTCTTGATCATCTGTTAGTTGCG 3 3 97 
1076 K P T C S F 1082 

3 3 98 TACTAACTT ATG TGT ATT AGT ATA ACA AGA AAA GAG AAT TAG AG AG ATG GAT TCT AAT CCG 34 58 
1 MCISITRKEN* M D S H F 5 

34 59 GTG TTG CAG TAC ATC TTT TCT CCA CCC GCA TAA AAJ\AAAAAJLAAAAAAAAAAAAAAAA^^^ 3 522 
~6-¥ — 4r-~— Q ^.=-=-1 -J fi— P. -P A _._ - 1 6 
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Figure 7 



Comments/References: 43= 5' side of S8 (AIMSH6) 2182 bp in pUC18/Sma1 





Figure 8 



Comments/References: 62= 3' side rA S8 (AtM3H6, -,379bp in pUC18/Sma1 
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1 AAAAGTTGAGCCCTGAGGAGTATCGTTTCCGCCATTTCTACGACGCAAGGCG.--~_---.TTTTTGGCGCCAAT CTTTCCCCCC S 0 

SI TTTCGAATTCTCTCAGCTCAAAACATCGTTTCTCTCTCACTCTCTC7CACAA77CCAAA-AA ATC- CAG CGC CAG 13: 

1 • M Q ?. Q 4 

~"^^^AX5A^CG-A^P-*g^ , T.T 5, GTT TCC 213 

5R S I L S FFQK P TAATT KGL V 

214 GGC GAT GCT GCT AGC GGC GGG GGC GGC AGO GGA GGA CCA CGA TTT AAT GTG A-AG GAA GGG 27 2 

25 G D A A S G G G G S G G ? K F N V K EG 4 4 

27 4 GAT GCT AAA GGC GAC GCT TCT GTA CGT TTT GCT GTT TCG AAA TCT GTC GAT GAG GT7 AGA 3 33 

45 D A K G D A S V R F A V 3 F S V C E V R 64 

334 GGA ACG GAT ACT CCA CCG GAG AAG GTT CCG CGT CGT GTC CTG CCG TCT GGA TTT AAG CCG 3 93 

65GTDTPPEKVPRRYLP S G F K ? S4 

3 94 GCT GAA TCC GCC GGT GAT GCT TCG TCC CTG TTC TCC AAT ATT ATG CAT AAG TTT GTA AAA 4 52 
85 A E S A G D A S S L F S 3 M H F V K 104 

4 54 GTC GAT GAT CGA GAT TGT TCT GGA GAG AGG AGC CGA GAA GAT GTT GTT CCG CTG AAT GAT 512 
105 V D D R D C S G E k S ?. E C V V ? L N D 134 

514 TCA TCT CTA TGT ATG AAG GCT AAT GAT GTT ATT CCT CAA TTT CGT TCC AAT AAT GGT AAA 5~2 

125 S S L C M K A N D V I ? C_ F R S " IT G K 144 

57 4 ACT CAA GAA AGA AAC CAT GCT TTT AGT TTC AGT GGG AGA GCT GAA CTT AGA TCA GTA GAA 53 3 

145 T Q E R N H A F 3 F S G F. A E L ?. S V E 164 

63 4 GAT ATA GGA GTA GAT GGC GAT GTT CCT GC-T CCA GAA AC A CCA GGG ATG CGT CCA CGT GCT 5=3 

16 5 D I G V D G D V ? rj ? z 7 ? G M F. ? P. A 154 

694 TCT CGC TTG AAG CGA GTT CTG GAG GAT GAA ATG ACT TTT AAG GAG GAT AAG GTT CCT GTA "33 

185SRLKRVLEDEHT F F. E D ". V ? V I C 4 

7 54 TTG GAC TCT AAC AAA AGG CTG .AAA ATG CTC CAG GAT ICG GTT TGT GGA GAG AAG AAA GAA £12 

2C5 LDSNKRLK M L Q D ? V C G E r" K E 2.2 4 

614 GTA AAC GAA GGA ACC AAA TTT GAA TGG CTT GAG TCT TCT CC-A ATC AGG GAT GCC AAT AGA "3 

225 V 13 E G T K F E W L E S S ?. I R T A N R ~ 4 4 

£7 4 AGA CGT CCT GAT GAT CCC CTT TAC GAT AGA AAG AC T CTA CAC ATA CCA CCT GAT GTC TTC 32 2 

245 R R P D D PL V D F. K T 1 H I ? ? C V F 1 £ 4 



9 34 AAG AAA ATG TCT GCA TCA CAA AAG CAA TAT 
265 K K M S A S Q K Q V 

994 GTG CTT TTC TTT AAA GTG GGG AAA TTT TAT 
285 VLPPKVGKFY 

1054 CAC AAG GAG CTT GAC TGG AAG ATG ACC ATG 
305 H K E L D W K H T M 

1114 ATC TCT GAA AGT GGG ATA GAT GAG GCA GTG 
325 ISESGIDSAV 

117 4 GGA CGA ATC GAG CAG CTA GAA ACA TCT GAC 
345 GRIEQLETSD 



TGG AGT GTT AAG AGT GAA. TAT AT G GAT ATT ~ r 2 

W £ Y S 5 V :■: D I 3=4 

GAG CTG TAT GAG CTA GAT C-CG GAA TTA GGT 2 3 53 

E L V E L D A E L G 3 3 4 

AGT GGT GTG GGA AAA. TGC AGA CAG GTT GGT 1113 

S G Y G K C F Q V G 224 

CAA AA.G CTA TTA GCT CGT GGA TAT AAA GTT 1173 

QKLLARGVKV 344 

CAA GCA AAA GCC AGA GGT GCT AA.T ACT ATA 12 33 

Q A K A R G A N T I 3 54 



123 4 ATT CCA AGG AAG CTA GTT CAG GTA TTA ACT CCA TCA ACA GCA AGC GAG CGA AAC ATC GGG 12 93 

— T p p y t y o y l T P t w _T_& S E G K I _C3^ 3 84 

Z294 CCT GAT GCC GTC CAT CTT CTT GCT ATA AAA GAG ATC AAA ATG GAG CTA CAA AAG TGT TCA 13 53 

385 PDAVHLLAIKEIKMELQKCS 404 

13 54 ACT GTG TAT GGA TTT GCT TTT GTT GAC TGT GCT GCC TTG AGG TTT TGG GTT GGG TCC ATC 1413 
405 TVYGFAFVDCAALRFWVG SI 424 

1414 AGC GAT GAT GCA TCA TGT GCT GCT CTT GGA GCG TTA TTG ATG CAG GTT TCT CCA AAG GAA 147 3 

425 SDDASCAALGALLMQVSPKE 444 

14 74 GTG TTA TAT GAC AGT AAA GGG CTA TCA AGA GAA GCA CAA AAG GCT CTA AGG AAA TAT ACG 153 3 
445 VLYDSKGLSREAQKALRKYT 464 

153 4 TTG ACA GGG TCT ACG GCG GTA CAG TTG GCT CCA GTA CCA CAA GTA ATG GGG GAT ACA GAT 1593 

465 LTGSTAVQLAPV PQVMGDTD 484 



5 94 GCT GCT GGA r^IT AG A 
4 85 A A G V R 
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Figure 9 (contiifufed) # 'a^ : ". .... 

AAT ATA ATA GAA TCT AAC GGA TAC TTT AAA GO'S *TCT* J3AA TCA*»;6f?3*» 
N I I ESNGYrKG S S 2"* S *-50*. 
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2914 CAG GAT TCA CTA GTA ATC CTT GAC GAA CTG GGC AGA GGA ACT AGT ACT TTC GAT GGA TAC 2 9 73 

— o -n-r— n n , t ,t- - -t t r> g T a. R G T S T F D G Y 94_4_ 

297 4 GCC ATT GCA TAC TCG GTT TTT CGT CAC CTG GTA GAG AAA GTT CAA TGT CGG ATG CTC TTT 30 3 3 

945 AIAYSVFRHLVEKVQCRMLF 964 

303 4 GCA ACA CAT TAC CAC CCT CTC ACC AAG GAA TTC GCG TCT CAC CCA CGT GTC ACC TCG AAA 30 93 

965 ATHYHPLTKEFASHPRVTSK 984 

3094 CAC ATG GCT TGC GCA TTC AAA TCA AGA TCT GAT TAT CAA CCA CGT GGT TGT GAT CAA GAC 3153 

985 H^MACAFKSRSDYQPRGCDQD 1004 

3154 CTA GTG TTC TTG TAC CGT TTA ACC GAG GGA GCT TGT CCT GAG AGC TAC GGA CTT CAA GTG 3213 

1005 LVFLYRLTEGACPESYGLQV 1024 

3214 GCA CTC ATG GCT GGA ATA CCA AAC CAA GTG GTT GAA ACA GCA TCA GGT GCT GCT CAA GCC 3273 

1025 ALMAGIPNQVVETASGAAQA 1044 
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Figure 11 







Complete genomic DNA. sequence, 8062 bp, of AtMSH6 



TTTTTTGGTTGCTAACAATAAAGGTATACGGTTTTATGTCATCAAT ATAA 5 0 

CTATATATAAAAGAAATGAAAGATATATATTGTTTTTTCATTTATC AAAC 10 0 

~A7ffiA"CAACAAGACTTTTTTTTTA*CTT^ ~~ TSXT 

GATAAACGACATCGTTTAATCATTTCCCAATTTTACCCCTAAGTTTAACA 2 00 

CCTAGAACCTTCTCCATCTTCGCAAGCACAGCCTGATTAGGAACAGCTTT 2 5 0 

ACCATTCTCATATTCCTGAACTACCTGAGTCCTCTCATTGATCTGTTTCG 3 0 0 

CCAAATCCGCTTGTGACATCTTCTTCTCCAATCTCGCTTTCTGTATCATC 3 5 0 

AACCTCACCTCTGCTTTCACACGATCCATCGCCGCAGGCTCTGTTTCTTC 4 0 0 

TTCCAGCTTCTTCGTGTTAATCACCGGAACCGCCGTAGATTTCCCCTTTT 4 50 

TGTTCGAACCGGCATCGAATTTCTTAACCGTTTGAACCGCGACACCGTTT 5 0 0 

CTCAGAGCTGCGTTAACCGCTTTCGGATCGCGTAGGTCTTGGCTCTTTTG 5 50 

TTTTGATTTGTGGAGAACTACTGGTTCCCAGTCTTGTGTTACTGCTCCTG 6 0 0 

GGTATCTGCTCGGCATCGTCGATGAATTGAGAGAAAGGAACAACGCGAAA 6 5 0 

ATTTTATTAATCTGAGTTTTGAAATTGAGAAACGATGAAGATGAAGAATG 7 0 0 

TTGTTGAGAGGATTGTGATATTTATATATACGAAGATTGGTTTCTGGAGA 7 5 0 

ATTCGATCATCTTTTTCTCCATTTTCGTCTCTGGAACGTTCTTAGAGATG 8 0 0 

ATTGACGACGTGTCATTATCTGATTTGCAGTTAACCAATGCTTTTTGGGT 8 5 0 

TGGATTCGTGGTACACCATATTATCCGATTTGGCTCAATGGTTTTATATA 9 0 0 

AATTTGGTTTTCGGTTCGGTTATGAGTTATCATTAAAATTAAGCTAACCA 95 0 

AAAATTTTCGTAAAATTTATTTCGGTTTCAATTCGGATCCCTTACTTCCA 10 0 0 

GAACCGAATTATTCGAAACCGGGGTTAGCCGAACCGAATACCAATGCCTG 10 5 0 

ATTGACTCGTTGGCTAGAAAGATCCAACGGTATACAATAATAGAACATAA 110 0 

ATCGGACGGTCATCAAAGCCTCAAAGAGTGAACAGTCAACAAAAAAAGTT 115 0 

GAGCCCTGAGGAGTATCGTTTCCGCCATTTCTACGACGCAAGGCGAAAAT 12 0 0 

TTTTGGCGCCAATCTTTCCCCCCTTTCGAATTCTCTCAGCTCAAAACATC 12 5 0 

GTTTCTCTCTCACTCTCTCTCACAATTCCAAAAAATGCAGCGCCAGAGAT 13 0 0 

CGATTTTGTCTTTCTTCCAAAAACCCACGGCGGCGACTACGAAGGGTTTG 13 5 0 

GTTTCCGGCGATGCTGCTAGCGGCGGGGGCGGCAGCGGAGACCACGATTT 14 0 0 

AATGTGAAGGAAGGGGATGCTAAAGGCGACGCTTCTGTACGTTTTGCTGT 14 5 0 

TTCGAAATCTGTCGATGAGGTTAGAGGAACGGATACTCCACCGGAGAAGG 15 0 0 

TTCCGCGTCGTGTCCTGCCGTCTGGATTTAAGCCGGCTGAATCCGCCGGT 15 5 0 

GATGCTTCGTCCCTGTTCTCCAATATTATGCATAAGTTTGTAAAAGTCGA 16 0 0 

TGATCGAGATTGTTCTGGAGAGAGGTACTAATCTTCGATTCTCTTAATTT 16 5 0 

TGTTATCTTTAGCTGGAAGAAGAAGATTCGTGTAATTTGTTGTATTCGTT 17 0 0 

GGAGAGATTCTGATTACTGCATTGGATCGTTGTTTACAAATTTTCAGGAG 17 5 0 

CCGAGAAGATGTTGTTCCGCTGAATGATTCATCTCTATGTATGAAGGCTA 18 0 0 

ATGATGTTATTCCTCAATTTCGTTCCAATAATGGTAAAACTCAAGAAAGA 18 5 0 

AACCATGCTTTTAGTTTCAGTGGGAGAGCTGAACTTAGATCAGTAGAAGA 1900 

TATAGGAGTAGATGGCGATGTTCCTGGTCCAGAAACACCAGGGATGCGTC 1950 

CACGTGCTTC TCGCTTGAAGCGAGTTCTGGAGGATGAAATGACTTTTAAG 2000 

GAGGA'rAAGGTTCCTGTATTGGACTCTAACAAAAGGCTGAAAATGCTCCA 2050 

GG AT C CGG.T-TTGTGGAGAGAAGAAAG AAGT AAACG A AGGA A C.C A AA^CTG 3-1-0-0- 

AATGGCTTGAGTCTTCTCGAATCAGGGATGCCAATAGAAGACGTCCTGAT 215 0 

GATCCCCTTTACGATAGAAAGACCTTACACATACCACCTGATGTTTTCAA 22 0 0 

GAAAATGTCTGCATCACAAAAGCAATATTGGAGTGTTAAGAGTGAATATA 2 2 5 0 

TGGACATTGTGCTTTTCTTTAAAGTGGTTAGTAACTATTAATCTAGTGTT 2 3 0 0 

CAATCCATTTCCTCAATGTGATTTGTTCACTTACATCTGTTTACGTTATG 2 3 5 0 

CTCTTCTCAGGGGAAATTTTATGAGCTGTATGAGCTAGATGCGGAATTAG 24 0 0 

GTCACAAGGAGCTTGACTGGAAGATGACCATGAGTGGTGTGGGAAAATGC 24 5 0 

AGACAGGTAAATTAGTTGAAACAACTGGCCTGCTTGAATTATTGTGTCTA 2 5 00 



Figure 11 (Continued) 



TAAATTTTGACACCACCTTTTGTTTCAGGTTGGTATCTCTGAAAGTGGGA 2 5 5 0 

TAGATGAGGCAGTGCAAAAGCTATTAGCTCGTGGGTAAGGGAACCATCAT 2 6 0 0 

ACTTTATGGAATTCGTTTACTGCTACTTCGGCTAGGATTTAAGAAATGGA 2 6 5 0 

AATCACTTCAAGCATCATTAGTTAGGATCCTGAGAACTCAGGATGTTTTC 2 7 0 0 

_ -TTAXXCGTXA TATAATAA GTCTTTTC ATCAAGGAGT AAC AAACAAAACTT 2 7 5 0 

GCACAATATTTGTGTGCTCACTGGCAAGGCATATATACCCAGCTAACCTT 2800 

TGCTAGTTCACTGTAGTAACAGTTACGGATAATATATGTTTACTTGTATG 2 8 50 

TGGTACCCTCATTTT.GTCTCTCATGGAGGCTTTCAAGCCTTGTGTTGAAA 2 9 0 0 

CTGGATAGTTACATATGCTTCCAACAGAAACTAGCATGCAGATTCATATG 2 9 5 0 

CTTTCCTATTCTACTAATTATGTATTGACACACTCGTTGTTTCTTTTGAA 3 0 00 

AGATATAAAGTTGGACGAATCGAGCAGCTAGAAACATCTGACC AAGCAAA 3 0 5 0 

AGCCAGAGGTGCTAATACTGTAAGTTTTCTTGGATAGGTCAAGGAGAGTG 3100 

TTGC AGACTGTTTTTG ATC ATTTCTTTTTCTGT AC ATTACTTTCATGCTG 3150 

TAATTAACTCAATGGCTATTCTGGTCTGATTATCAGATAATTCCAAGGAA 3 2 00 

GCT AGTTC AGGTATT AACTCC ATC AAC AGCAAGCGAGGGAAAC ATCGGGC 3 2 50 

CTGATGCCGTCCATCTTCTTGCTATAAAAGAGGTTTGTTATTTACTTATT 3 3 00 

TATCTTATCATGTTCAGTTCATCCAAGTCCTGAAAAATTACACTCTTCTT 3 3 50 

TACCAATCTTCCATCAAGCTGTGTAAAGGATTTGGAATTAGAAAATCATT 3 4 0 0 

ATTTGATGCTTTGTTTTATATGCAAGAGGTTCCCTTGAAAAGATCTGTTT 3 4 5 0 

AAGATTCTTTGCACTTGAAAAATTCAATCTTTTTAAGTGAATCCCCTACT 3 5 0 0 

TTCTTACAATGATCATAGTCTGCAATTGCATGTCAAGTAATATCATTCCT 3 5 50 

TGTTACTGCATCCCCCTCTTTCTTAATGACCATTGTCTATGTTGTGTTTG 3 6 0 0 

T CTCGTGTG CTGG AG AAAATG ATAG CTGAT C C AAGCTGT AC ATT ATC ATG 3 6 5 0 

ATTAAGTAGCTGCTCAGGAATTGCCTTTGGTTACATTGCCTAATGGTTTG 3 7 0 0 

ATGTCAATTTTTCTTCTGAATCTTTATTTTAGATCAAAATGGAGCTACAA 3 75 0 

AAGTGTTCAACTGTGTATGGATTTGCTTTTGTTGACTGTGCTGCCTTGAG 3 8 0 0 

GTTTTGGGTTGGGTCCATCAGCGATGATGCATCATGTGCTGCTCTTGGAG 3 8 5 0 

CGTTATTGATGCAGGTAAGCAAGTGTATTCTGTATCTTATGTGTACCATG 3 3 0 0 

TGACTTCCTGTGCATATATTTGGGTTGCAGGAACTAATTCTGAATCACCA 3 95 0 

TTTGGTATGTTTTTTCCAGGTTTCTCCAAAGGAAGTGTTATATGACAGTA 4 0 0 0 

AAGGTAAACTGCTTGTATCGCCAGTTGTTTTGTTAAACAGAATTTAAGGT 4 0 5 0 

AAATGACACTGGTTAATTTAAAGTGCATACATGTTGAAATATTGCAGGGC 410 0 

TATCAAGAGAAGCACAAAAGGCTCTAAGGAAATATACGTTGACAGGTACC 415 0 

ATTTCAGTAGGCAAGCTAACTGACAATTTAACCGCTCACCGAATGATAGG 4 2 0 0 

TCTCTTAAACATTGCTAATGTAGATGATGTTTATGTTTCAATCTAATAGG 4 2 5 0 

GTCTACGGCGGTACAGTTGGCTCCAGTACCACAAGTAATGGGGGATACAG 4 3 0 0 

ATGCTGCTGGAGTTAGAAATATAATAGAATCTAACGGATACTTTAAAGGT 4 3 5 0 

TCTTCTGAATCATGGAACTGTGCTGTTGATGGTCTAAATGAATGTGATGT 44 0 0 

TGCCCTTAGTGCTCTTGGAGAGCTAATTAATCATCTGTCTAGGCTAAAGG 4 4 5 0 

TGTGTTGGCTTGTTTAGTTTTTGCTTTTC ACAAATTAAGCAAAGGAACTT 4 50 0 

TTCATAACTTACAGTTTCTATCTACTTGCAGCTAGAAGATGTACTTAAGC 4 55 0 

ATGGGG ATATTTTTCCATACCAAGTTTACAGGGGTTGTCTCAGAATTGAT 4 6 0 0 

cc i C-CAC i AC.C i ATCzciT' ix a 7i TP T Tr; a n a t a t tt r AC-^^^^ ^ Tnr.Tnn 4650 

TCCTTCAGGCAAGTGCATATTTCTTTTTTGATAACTTCAACTAGAGGGCA 4 7 0 0 

"GACAT AGAAGGAAXAATTCTAA — 4~7"5-0' 

AGCCGATTTTTGTTTACCTATGTAGGGACCTTGTACAAATATCTTGATAA 4 80 0 

CTGTGTTAGTCCAACTGGTAAGCGACTCTTAAGGAATTGGATCTGCCATC 4 85 0 

CACTCAAAGATGTAGAAAGCATCAATAAACGGCTTGATGTAGTTGAAGAA 4 90 0 

TTCACGGCAAACTCAGAAAGTATGCAAATCACTGGCCAGTATCTCCACAA 4 95 0 

ACTTCCAGACTTAGAAAGACTGCTCGGACGCATCAAGTCTAGCGTTCGAT 500 0 

CATCAGCCTCTGTGTTGCCTGCTCTTCTGGGGAAAAAAGTGCTGAAACAA 5 05 0 

CGAGTAAGTATCAATCACAAGTTTTCTGAGTAATGCCTTCCATGAGTAGT 510 0 

ATAGGACTAAAACATTACGGGTCTAGCTAAAGACTGTTCTCCTTCTTTTG 515 0 
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CAATGTCTGGTTATTCATTACATTTCTCTTAACTTATTGCATTGCAGGTT 5 2 0 0 

AAAGCATTTGGGCAAATTGTGAAAGGGTTCAGAAGTGGAATTGATCTGTT 5 2 5 0 

GTTGGCTCTACAGAAGGAATCAAATATGATGAGTTTGCTTTATAAACTCT 5 3 0 0 

GTAAACTTCCTATATTAGTAGGAAAAAGCGGGCTAGAGTTATTTCTTTCT 5 3 5 0 

CAATTCGAAGCAGCCATAGATAGCGACTTTCCAAATTATCAGGTGCCCAT 54 0 0 

— 'CT AT CTTT C ATACTTTA'CAA^AAS^^GTCTGTCACTACTC A^^GCAATG " — ~ 5T5TT 

ATATGGCTTAGATCTCAACTCACACCCCGAGGATCCTAAAGGGATTTGCT 5 5 0 0 

TTTTATTCCTAATGTTTTTGGATGGTTTGATTTATTTCTAACTTGAACTT 5 55 0 

ATTAATCTTGTACCAGAACCAAGATGTGACAGATGAAAACGCTGAAACTC 5 6 0 0 

TCACAATACTTATCGAACTTTTTATCGAAAGAGCAACTCAATGGTCTGAG 5 6 5 0 

GTCATTCACACCATAAGCTGCCTAGATGTCCTGAGATCTTTTGCAATCGC 5 70 0 

AGCAAGTCTCTCTGCTGGAAGCATGGCCAGGCCTGTTATTTTTCCCGAAT 5 7 5 0 

C AG AAG CT AC AG AT C AG AAT C AG AAAA C AAAAGG G C C AAT A CT T AAAAT C 5 8 0 0 

CAAGGACTATGGCATCCATTTGCAGTTGCAGCCGATGGTCAATTGCCTGT 5 8 5 0 

TCCGAATGATATACTCCTTGGCGAGGCTAGAAGAAGCAGTGGCAGCATTC 5 9 0 0 

ATCCTCGGTCATTGTTACTGACGGGACCAAACATGGGCGGAAAATCAACT 5 9 5 0 

CTTCTTCGTGCAACATGTCTGGCCGTTATCTTTGCCCAAGTTTGTATACT 6 0 0 0 

CGTTAGATAATTACTCTATTCTTTGCAATCAGTTCTTCAACATGAATAAT 6 0 5 0 

AAATTCTGTTTTCTGTCTGCAGCTTGGCTGCTACGTGCCGTGTGAGTCTT 610 0 

GCGAAATCTCCCTCGTGGATACTATCTTCACAAGGCTTGGCGCATCTGAT 615 0 

AGAATCATGACAGGAGAGAGTAAGTTTTGTTCTCAAAATACCAATTCCTC 6 2 0 0 

GAACTATTTACTCAGATTTTGTCTGATTGGACAAGGTGGTTTTGCTTTTT 6 2 5 0 

TTTAGGTACCTTTTTGGTAGAATGCACTGAGACAGCGTCAGTTCTTCAGA 6 3 0 0 

ATGCAACTCAGGATTCACTAGTAATCCTTGACGAACTGGGCAGAGGAACT 6 3 5 0 

AGTACTTTCGATGGATACGCCATTGCATACTCGGTAACCTGCTCTTCTCC 6 4 0 0 

TTCAACTTATACTTGTTGATCAACAAAAACATGCAATTCATTTTGCTGAA 6 4 5 0 

ACTTATTGATTTATATCAGGTTTTTCGTCACCTGGTAGAGAAAGTTCAAT 6 5 0 0 

GTCGGATGCTCTTTGCAACACATTACCACCCTCTCACCAAGGAATTCGCG 6 55 0 

TCTCACCCACGTGTCACCTCGAAACACATGGCTTGCGCATTCAAATCAAG 6 6 0 0 

ATCTGATTATCAACCACGTGGTTGTGATCAAGACCTAGTGTTCTTGTACC 6 6 5 0 

GTTTAACCGAGGGAGCTTGTCCTGAGAGCTACGGACTTCAAGTGGCACTC 6 7 0 0 

ATGGCTGGAATACCAAACCAAGTGGTTGAAACAGCATCAGGTGCTGCTCA 6 75 0 

AGCCATGAAGAGATCAATTGGGGAAAACTTCAAGTCAAGTGAGCTAAGAT 6 8 0 0 

CTGAGTTCTCAAGTCTGCATGAAGACTGGCTCAAGTCATTGGTGGGTATT 6 8 5 0 

TCTCGAGTCGCCCACAACAATGCCCCCATTGGCGAAGATGACTACGACAC 6 9 0 0 

TTTGTTTTGCTTATGGCATGAGATCAAATCCTCTTACTGTGTTCCCAAAT 6 9 5 0 

AAATGGCTATGACATAACACTATCTGAAGCTCGTTAAGTCTTTTGCTTCT 7 0 0 0 

CTGATGTTTATTCCTCTTAAAAAATGCTTATATATCAAAAAATTGTTTCC 70 50 

TCGATTATAACAAGATTATATATGTATCTGTCGGTTTAGCTATGGTATAT 710 0 

AATATATGTATGTTCATGAGATTGGTCAAGAGAAATACTCACAAACAGTA 715 0 

TATTAAGAAGGAAATATGTTTATGCATTAATTTAAGTTTCAAGATAAACT 7 2 0 0 

GC AAATAACCTCGACTAAAGTTGCAAAGACCAAACACAAATTACAAAACT 72 5 0 

TATAAGACTTAAGTTCTGAATTCCCTAAAACCAAAAAAAAAAACAGAACA 7300 

TATTTTGTTGCATCTACAAACAACACAAACCTACATAGTTTATAACTTAC 7 3 5 0 

_:T-C-AT-CarT-riAnAT,T_AACAT-CAG^ 

CTCATCATCATCACCACCACCATGATGATTCTCCTCCTCTTCACGTAACC 74 50 

TAGCAATCTCACTCTGAGCTCTATCAACAATCTGCTTCTTCTGCAACTCC 75 0 0 

AAATCTCTCTGAAAATCAGCTCTCATCTTCTCCAACTCCTTCATTTGCTC 7 55 0 

TTTCTTACTCTTCTCCATCTTCTCATAAACCTTCCCAAACCTCTCAACAG 7 6 0 0 

AATCCGCCAACATCTTATACGAAGCAGCGTCATTAACCTTCTTCCTCTCG 76 50 

TACTCAACCTCATCATCCTCATCCTCCTCCTCTTCAGAATCACCAGGACT 7 70 0 

ATCCATCATCTCATCAAACCCATTAGACTTATCTAAATAAACCTTAGTGT 77 50 

T C AT AAAC AC AAACT C AC CTGAAT C AAC AC C AC AAG CT AAAC CT AAAT C C 78 0 0 



1 8 /'2 3 

Figure 11 (CohtinuSd 



9 



GACTTGGGCGAAACACAAAGCAACATATCCAACTTATTGAAAAACGACCA 78 5 0 

TTTACTTGAACCTAAACCTGATTTCTCAACCTTAATCTTCTCTTTTCTAT 7 9 0 0 

ACTTCCTCTTCAAGTCATCAATCATTCTCCTACATTGCGTCTCAGATTTC 7 9 5 0 

TCCATCCTTAGCTCCTCACTCACTTTCTCAGCTACTTCATTCCAATCCTC 80 00 

GTTCCTCAAACTCCTTCTACCCAATTGCAAAAACCTATCTCCCCAAACTT 8 0 5 0 

-eftAG€AA€AGAA=™ — - ™, " • . a06 2 




Figure 12 



Comments/References: S5 3' side antisense : : 3 t:Dm pUC18/13 Sal1/Sst1/T4 
into 85/35S BamH1/T4 in Agrobacterium LBA44C4 
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Figure 13 



Comments/References: S8 3' side antisens : 62 Sall/Sst1/T4 into 85 35S/BannH1/T4 
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Figure 14 



Comments/References: 85 35S S8 55 antisenss : S8 3' side (62) Sal1/3st1/T4 
into pPFl3 (85 35S S5 3' side antisens)/Sma1 . in LBA4404 
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Figure 15 



Comments/References: 85 35S S5. 38 antisens (D) : S5 3' side (13) Sal1/Sst1/T4 
into pPF14 (85 35S S8 3' side antisense). in LBA4404 
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Figure 16 
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